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Abstract 

Index coding is a source coding problem in which a broadcaster seeks to meet the different demands of several users, each 
of whom is assumed to have some prior information on the data held by the sender. A well-known application is to satellite 
communications, as described in one of the earliest papers on the subject It is readily seen that if the sender has knowledge of 
its clients’ requests and their side-information sets, then the number of packet transmissions required to satisfy all users’ demands 
can be greatly reduced if the data is encoded before sending. The collection of side-information indices as well as the indices 
of the requested data is described as an instance T of the index coding with side-information (ICSI) problem. The encoding 
function is called the index code of T, and the number of transmissions employed by the code is referred to as its length. The 
main ICSI problem is to determine the optimal length of an index code for and instance T. As this number is hard to compute, 
bounds approximating it are sought, as are algorithms to compute efficient index codes. These questions have been addressed by 
several authors (T|, (4), (5), jTj' I33l . l37l . often taking a graph-theoretic approach. Two interesting generalizations of the problem 
that have appeared in the literature are the subject of this work. The first of these is the case of index coding with coded side 
information do), (H, in which linear combinations of the source data are both requested by and held as users’ side-information. 

This generalization has applications, for example, to relay channels and necessitates algebraic rather than combinatorial methods. 

The second is the introduction of error-correction in the problem, in which the broadcast channel is subject to noise KTTl . In 
this paper we characterize the optimal length of a scalar or vector linear index code with coded side information (ICCSI) over a 
finite field in terms of a generalized min-rank and give bounds on this number based on constructions of random codes for an 
arbitrary instance. We furthermore consider the length of an optimal (5-error correcting code for an instance of the ICCSI problem 
and obtain bounds analogous to those described in (m, both for the Hamming metric and for rank-metric errors. We describe 
decoding algorithms for both categories of errors based on those given in 1111 . t35t . 
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Error Correction for Index Coding With Coded Side 

Information 


I. Introduction 

The problem of index coding with side information (ICSI) was introduced by Birk and Kol in under the term informed 
source coding on demand. In Q the authors explicitly refer to the problem as index coding. This topic is motivated by 
applications in broadcast communications such as audio and video on-demand, content delivery, and wireless networking. It 
relates to a problem of source coding with side information, in which receivers have partial information about the data to be 
sent prior to its broadcast. The problem for the sender is to exploit knowledge of the users’ side information to encode data 
optimally, that is to reduce the overall length of the encoding, or equivalently, the number of transmitted packets. The ICSI 
problem has since become a subject of several studies and generalizations m, ii, Q, EB, am, am, am. 

The scenario of the ICSI problem is the following. A server (sender) has to broadcast some data to a set of clients (receivers 
or users), with possibly different messages requested by different clients. Before the transmission starts, each receiver already 
has some data in its possession, its cached packets, called its side-information. These packets may be from a previous broadcast, 
perhaps sent during lighter data traffic periods, or acquired by some other communication. The receivers let the sender know 
which messages they have, and which they require. The broadcaster can use this information, along with encoding, to reduce 
the overall number of packet transmissions required to satisfy all the demands of its clients. If the sender has been successful 
in this endeavour, then the broadcasted data can be utilized by each user, along with its cached packets, in order to decode its 
own specific demand. 

The main index coding problem is to determine the minimum number of packet transmissions required by the sender in 
order to satisfy all users’ requests, if encoding of data is permitted. Given an instance of the ICSI problem, Bar-Yossef et 
al Q proved that finding the best scalar linear binary index code is equivalent to finding the min-rank of a graph, which is 
known to be an NP-hard problem lIMIl . The twin problem is to determine an explicit optimal encoding function for an instance. 
Any encoding function for an instance necessarily gives an upper bound on the optimal length of an index code. There have 
been a number of papers addressing this aspect of the problem, in fact finding sub-optimal but feasible solutions, using linear 
programming methods to obtain partitions of the users into solvable subsets. Such solutions involve obtaining clique covers, 
partial-clique covers, multicast partitions and some variants of these Q, ijS], 1331, 041 . 071 . Other than these LP approaches, 
low-rank matrix completion methods may also be applied. This was considered for index coding over the real numbers in 1221 . 

The importance of the index coding problem can also be seen in its equivalences and connections to other problems, such 
as network coding, coded-caching and interference alignment ca, EB, m, E 3 . These equivalences mean that results in 
index coding have impact in such other areas, and vice versa. 

In 04l . ITOl the authors give a generalization of the index coding problem in which both demanded packets and locally 
cached packets may be linear combinations of some set of data packets. We refer to this as the index coding with coded side 
information problem (ICCSI). This represents a significant departure from the ICSI problem in that an ICCSI instance no 
longer has an obvious association to a graph, digraph or hypergraph, as in the ICSI case. However, as we show here, it turns 
out that many of the results for index coding have natural extensions in the ICCSI problem. 

One motivation for the ICCSI generalization is related to the coded-caching problem. The method in 1161 uses uncoded 
cache placement, but the authors give an example to show that coded cache placement performs better in general. In ITTl . it 
is shown that in a small cache size regime, when the number of users is not less than the number of files, a scheme based on 
coded cache placement is optimal. Moreover in ITSl the authors show that the only way to improve the scheme given in lT6l 
is by coded cache placement. 

Another motivation is toward applications for wireless networks with relay helper nodes and cloud storage systems (see 
m and the references therein). Consider the example in Table |I] We have a scenario with one sender and four receivers Ui, 
U 2 , U 3 and U 4 . The source node has four packets Xi, X 2 , and X 4 and for i = 1, ...,4 user Ui wants packet Xi. The 
transmitted packet is subject to independent erasures. It is assumed that there are feedback channels from the users, informing 
the transmitting node which packets are successfully received. At the beginning, in time slot 1, 2, 3 and 4 the source node 
transmits packets Xi, X 2 , X^ and X 4 , respectively. After time slot 4 we have the following setting; Ui has packet X 2 , U 2 
has packet Xi, U 3 has packet X 4 and U 4 has packet X 4 . Now from the classical ICSI problem we have that receivers Ui and 
U 2 form a clique, in the associated graph, and then we can satisfy their request sending Xi -\- X 2 . Similarly for U 3 and U 4 we 
can use ^3 -f X 4 . So, the source node in time slot 5 and 6 transmits the coded packet Xi -f X 2 and X^, -\- X 4 , intending that 
users receive the respective packet. However, Ui and U 2 receive the coded packet X^ -\- X 4 and U 3 and U 4 receive Xi X 2 . 
At this point if only the uncoded packets in their caches are used, we still need to send two packets. If all packets in their 
caches are used, the source only needs to transmit one coded packet Xi -f X 2 -f X^ -f X 4 in time slot 7. If all four users 
can receive this last transmission successfully, then all users can decode the required packets by linearly combining with the 
packets received earlier. 
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'lime 

slot 

Packet sent 

Received 

by f/i? 

Received 

by f/ 2 ? 

Received 
by ?73? 

Received 
by C/ 4 ? 

1 

Xi 

no 

yes 

no 

no 

2 

X2 

yes 

no 

no 

no 

3 

X3 

no 

no 

no 

yes 

4 

X4 

no 

no 

yes 

no 

5 

Xi -hX2 

no 

no 

yes 

yes 

6 

X4-FX3 

yes 

yes 

no 

no 

7 

X4 + X2 + X3 + X4 

yes 

yes 

yes 

yes 


TABLE I 

Illustration of utilizing coded packets as side information. 


has Xi + JC2 


has X4, Xi + X2 


wants X 4 




wants X3 


+ X2 + X3 + X4 



wants Xi 




wants X2 


has X2, X2 X4 


has Xi, X3 + X4 


Fig. 1. State of network after time slot 6. 


A second generalization of the ICSI problem was given in CD, where the authors consider error correction. That is, the 
broadcast channel may be subject to noise during a transmission. Classical coding theory plays a role in several of the results 
and a number of bounds are given on the optimal length of an error correcting index code (ECIC) that corrects some <5 
Hamming errors. A decoding algorithm based on syndrome decoding is also described. We remark that error-correction for 
network coding has only been addressed for multicast, where rank-metric and subspace codes are proposed. There are numerous 
papers on this subject after the seminal works lf25l . lIMIl . Many of these are based on Gabidulin codes ll20l . 

A. Our Contribution 

In this paper we develop the theory of index coding further along the lines of these latter mentioned generalizations. That 
is, we consider the ICCSI problem both in the error-free case and with respect to error correction. We assume that the source 
data is composed of n blocks of length t over (the finite field of q elements), and that encoding involves taking F^- 
linear combinations of the n data blocks. In particular, we consider both linear and scalar-linear index codes. We describe a 
generalized min-rank for the ICCSI problem, which we show gives the optimal length for F^-linear encodings. This quantity 
is actually shown to be the minimum rank weight of the coset of an Fq-linear matrix code determined by an ICCSI instance. 
We characterize necessary and sufficient conditions for a matrix L to realize an instance of the ICCSI problem and use this 
to obtain upper bounds on the length of an optimal F^-linear index code with coded side information. The first of these may 
be viewed as a generalization of the bound obtained by the existence of a partial clique in the side-information graph of a 
classical index coding problem. It requires q to be large although it does not rely on the use of a maximum distance separable 
(MDS) code. The second of these bounds offers a refinement and relaxation of the constraint on q and is not explicit. Both 
are based on the probability that an arbitrary matrix realizes a code for an instance. 

Following the work of HD, we consider error correction for the ICCSI problem, both for the Hamming and rank metric 
and address the question of the main index coding problem for error correcting index codes. We establish criteria for error 
correction for an ICCSI instance and give bounds on the optimal length of a J-error correcting ECIC, both for the Hamming 
metric and the rank metric. These results are extensions of the k, a and sphere-packing and Singleton bounds as described in 
HD- Some of these also yield further upper bounds on the optimal length of an ICCSI code for the error-free case. 

Finally, we outline decoding strategies for linear ECICs for both the rank and Hamming distance. In the first case we extend 
the syndrome decoding method to correct Hamming errors for index codes given in HD to the ICCSI case. In the second, we 
show that the simple, low-complexity strategy for additive matrix channels given in lf 35 l can be applied to correct rank-metric 
errors, that is to handle error matrices of rank upper bounded by some 5 . 

II. Preliminaries 

We establish notation to be used throughout the paper. For any positive integer n, we let [n] := { 1 ,... ,n}. We write F^ 
to denote the finite field of order q and use F”^* to denote the vector space of all n x f matrices over F^. Given a matrix 
X € Fq^‘ we write Xi and to denote the ith row and jth column of X, respectively. More generally, for subsets S C \n] 
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and T C [t] we write Xs and X'^ to denote the |5| x t and n x |T| submatrices of X comprised of the rows of X indexed 
by S and the columns of X indexed by T respectively. We write {X) to denote the row space of X. 

In this work we will consider two distance functions, namely the Hamming metric and the rank metric, over the F^-vector 
space 

Choosing a basis of the hnite held of g* elements, it is easy to see that F^t and F^^* are isomorphic as F^-vector spaces. 
Then, given the usual dehnition of the Hamming distance between a pair of elements x,y G F^^: 

dH{x,y) := |{i : Xi ^ yi}\, 

we dehne the Hamming distance between a pair of matrices X,Y G F^^* as the number of coordinates in [n] such that 
Xi ^ Yi, so the number of differing rows of X and Y. 

For two matrices A, B G F^the rank distance between A and B is the rank of the matrix A — B over F^: 

d^]^{A,B) = rk(A - B). 

We write d{A^ B) to denote either distance function between A and B and we write w{A) to denote d{A, 0). Given A and a set 
S, d{A,S) = min{fi(A, 5') : S G 5}. In some cases we will specify explicitly which distance function should be understood, 
otherwise the reader should interpret d or ic as denoting either metric. 

Recall that for any pair of subspaces C7 and V, their sum is the subspace U + V = {u + v : u G U, v G V} and we write 
U (BV to denote the direct sum U (BV = {(m, w) : u G U,v G V}. Moreover U + V and U (BV are isomorphic if and only 
if t/ n C is the trivial space. For arbitrary x in the ambient space, the coset x + U := {x + u : u G U}. We use the standard 
notation U < V to denote that U is a subspace of V. 


HI. Index coding with coded side information 

In 041 the authors generalized the index coding problem so that coded packets of a data matrix X may be broadcast or 
part of a user’s cache. As mentioned before, this hnds applications, in broadcast channels with helper relay nodes. 

Before we present the model with coded side information, let us recall the scenario for uncoded side information (see im, 
ifT^ l. In that case, the data is a vector X G F” possessed by a single sender. There are m users or receivers, each of which 
has an index set Xi C [n], called its side-information. This indicates that the ith user possesses the entries of X indexed by 
Xi. The surjection / : [m] — [n] assigns users to indices, indicating that User i wants ^/(i) and it is also assumed that 
f{i) 4- The sender is assumed to be informed of the values f{i) and Xi of each user. 

We now describe an instance of index coding with coded-side information. There is a data matrix X G F^^* and a set 
of m receivers or users. X is thus a list of n blocks of length t over F^. For each i G [to], the ith user seeks some linear 
combination of the rows of X, say BiX for some Ri G F^. We’ll refer to Ri as the request vector and to RiX as the request 
packet of User i. A user’s cache denotes locally stored data, which it can freely access. In our model it is represented by a 
pair of matrices 

y(i) g jpd.xn ^(i) g jpd.xt 

related by the equation 

A(i) = v^^^X. 


While the matrix X may be unknown to User i, it is assumed that any vector in the row spaces of and can be 
generated at the ith receiver. We denote these respective row spaces by X^'d := {V^'d) and £dd := for each i. The side 

information of the ith user is {Xd\cd)y Similarly, the sender S has the pair of row spaces for matrices 

yiS) g jpdsxn ^{S) ^ y(S)x g 

and does not necessarily possess the matrix X itself. 

The ith user requests a coded packet RiX G with Ri G X('^^\X(d_ \Yq denote by R the m x n matrix over F^ with 
each zth row equal to Ri. The matrix R thus represents the requests of all to users. We denote by 

X := {A G F™®^” : A, G G [to]}, 

so that X = (Bie[m]d^d'> is the direct sum of the X^d as a vector space over F^. 

We dehne X := {Z G F™®^” : Zi G which may be viewed as the direct sum of to copies of X^^\ 

Remark III.l. The reader will observe that the classical ICSI problem is indeed a special case of the index coding problem 
with coded side information (cf. SB, SB)- Setting to be the nxn identity matrix, Ri — Of(i) G F^ and V^d to be the 
di X n matrix with rows = Oi. for each ij G Xi, yields X^l = (e^ : j G Xi). Then User i has the rows of X indexed by 
Xi and requests Xyy. 

Remark III.2. The case where the sender does not necessarily possess the matrix X itself can be applied to the broadcast 
relay channel, as described in SB- The authors consider a channel as in Fig. [^ and assume that the relay is close to the 
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Fig. 2. Broadcast Relay Channel 


users and far away from the source, and in particular that all relay-user links are erasure-free. Each node is assumed to 
have some storage capacity and stores previously received data in its cache. The packets in the cache of the relay node are 
obtained as previous broadcasts, hence it may contain both coded and uncoded packets. The relay node, playing the role of 
the sender, transmits packets obtained by linearly combining the packets in its cache, depending on the requests and coded 
side information of all users. It seeks to mimimize the total number of broadcasts such that all users’ demands are met. 

Definition III.3. An instance of the Index Coding with Coded Side Information (ICCSI) problem is a list X = (t, m,n,X, , R) 
for some positive integers t,m,n, subspaces X^^'i and of F” of dimensions ds,di for i G [m\ such that X = 
and a matrix i? in A”. 

For the remainder, we let t, m, n, X, X^^\ X, R be as described above and we fix Z = {t, m, n, X, X^^\R) to denote an 
instance of the ICCSI problem for these parameters. We now define what is meant by an index code for an instance X: it 
is essentially a map that encodes any data matrix X in such a way that each user, given its side-information and received 
transmission, can uniquely determine its requested packet RiX G F‘. 

Definition III.4. Let be a positive integer. We say that the map 

is an Fq-code for X of length N if for each zth receiver, i G [m] there exists a decoding map 

A X A-^*) ^F‘, 

satisfying 

VXeF”x‘ : D,{EiX),A) = R,X, 

for some vector A G X^^f in which case we say that E is an Z-IC. E is called an Fg-linear Z-IC if E{X) = LV^^'>X for 
some L G in which case we say that L represents the code E, or that the matrix L realizes E.\f t = 1, we say that L 

represents a scalar linear index code. If f > 1 we say that the code is vector linear. We write C to denote the space (LV^^'i). 

An encoding is sought such that the length N of the Z-IC is as small as possible. We shall be principally concerned with 
Fq-linear codes for an instance X. We assume that the side information matrices of all users are known to the sender, along 
with the demand vectors Ri. As we’ll see in the next section, this knowledge is sufficient to determine an encoding matrix L 
for an F^-linear Z-IC. These assumptions are in keeping with those outlined in lb) for the original informed source coding on 
demand problem and are based on the existence of a slow error-free reverse channel allowing communication from users to 
the sender. We also assume that is known to the receivers before the broadcast of the encoded matrix LV^^'^X. This 

knowledge, along with the transmission LV^^'^X and its own cache data will be used by each Ah user in order to compute its 
demand RiX. These assumptions mean that the gains of encoding an ICCSI instance are greater as t increases. 


A. Necessary and Sufficient Conditions for Realization of an ¥g-Linear X-IC 

In the following we give necessary and sufficient conditions for a matrix L to represent a linear code of the instance X (in 
fact the sufficiency of the statement of Lemma III.5 has already been noted in OtI ). 

Lemma III.5. Let L G Then L represents an V q-linear X-IC index code of length N if and only if for each i G [m], 

Ri G C-f A-W. 


Proof: Let i G [m\ and let Ri G X^^\ Suppose that Y = LV^^^^X has been transmitted. If Ri G C -\- then there 
exist A G ¥q\B G F^ such that Ri = AV^^'^ BLV^^K Then for any X G F”x* vve have 

R,X = AV^^'^X -G BLV^^^X = AA^^'> -G BY. 
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Therefore, Receiver i, knowing and A^''\ can compute A and B and hence acquires RiX. 

Conversely, suppose that Ri ^ C + Then for each U € F*, we have 


rank 


Rz 

]/(*) 

LV<^S} 


U 

A(*) 

Y 


= 1 + rank 


= 1 + rank 


LV <^ s ) 

!/(*) 

LV <^ s ) 


A ('0 

Y 


= rank 


Ri 

1/(0 

LV^-S) 


In particular, the linear system 


RzX = U, V''^X = A<'^\LV^^'>X = Y 


is consistent for each U € Fo . It follows that 


Pr(R,X = C/|1/WX = A^^\LV^^'>X = F) = -, 

T 

so the side information of Ri conveys no information about RiX to the Ah receiver. 


( 1 ) 


Lemma III.5 simply says that the demands of all users can be simultaneously satisfied if and only if for each i the smallest 
vector space containing both C and ^”(0 also contains Ri\ in other words extending the side-information spaces by the 
same space C in each case contains the Ah request vector. This is achieved, for example, if £ + is the space for 
each i, although this is clearly not necessary. 


An equivalent formulation of the statement of Lemma III.5 is to say that L represents a linear index code for I if and only 
if £ meets each coset Ri + (T’(*( n = {Ri + A : A G A’(*( n We will use this view to obtain an upper bound on 


the optimal length on a linear index code in Theorem III. 18 

Given an f x n matrix A G F^^", we write A-^ to denote the null space of A in F”. Furthermore, for each i G 
define the sets: 


m we 




:= {FeF^^‘:LWF = 0} = (l/(*)'^)‘, 
:= {ZgF"^‘ : L(*)Z = 0,i?,Z^0}. 


To help put these sets in context, if has rows composed of standard basis vectors, say with leading ones indexed by the 
set 5* C [n] (which means the side-information of User i is uncoded) then consists of those matrices whose columns 
indexed by 5® are all-zero. Then 3^(®^ can be identified with the set [n\\Xi, the complement of the side information of user i 
and Z(®( can be identified with [n]\Xi U {/(«)}. 

Remark III.6. In the classical ICSI problem, two data matrices X and X' are called confusable at receiver i (cf. if they 
yield the same side information for i, i.e. Xj = X{ for all j G Xi, and if moreover the packets Xf(^i'j and are different 

(here Xi represents the side information of the receiver i and f{i) the request packet). In the ICCSI problem, two vectors 
X,X' are called confusable at receiver i ifV^'^'iX = V^'^'^X' and RiX f RiX', i.e. if they yield the same side information 
for the ith user but the requested data packets are different. Therefore, X and X' are confusable at receiver i if and only if 
X — X' lies in the set 


The essential content of next result, which follows from Lemma III. 5 is that L represents a linear code of I if and only if 


any confusable pair X,X' result in different encodings. Therefore, another way of stating Corollary III.7 is: 


L represents an ¥q-linear I-IC if and only if X f LV^^^X' for any confusable pair X,X' G F”^*. 


Then L realizes and Fq-linear I-IC if and only if no matrix of vanishes after multiplication by LV^^\ so may 
be used to characterize all linear codes of I. Of course LV^^'^Z is non-zero if and only if it has positive weight. This result 
will be generalized further in Theorem IV.2 to give a criterion for error-correction. 


Corollary III.7. Let L G Then L represents an ¥q-linear I-IC of length N if and only if LV^^'iZ 0/or each 

i G [m], and Z G . 


Proof: Fix some i G [m] and let Zq G Z^'‘\ let LV^^^Zq = W. Suppose that R^ ^ C Y Then as in the proof of 
Lemma |III.51 the linear system 

R,Z = uM^Z = D,LV'-^'>Z = W (2) 


is consistent for every choice of C/ G F^. In particular, § has a solution Z\ for (7 = 0. Then Z = Z^ — Z\ G Z^’'^ and 
LV^^'iZ = 0. We have shown that if L does not represent a linear code for I then for some i, there exists Z G Z(®) such that 

















JOURNAL OF LTbX CLASS FILES, VOL. ??, NO. ?, MONTH YEAR 


6 


LV^^'^Z = 0. Applying the contrapositive, this yields that if T:ank{LV^^'> Z) > 1 (i.e. if Z) ^ 0) for each i £ [to] and 
Z £ Z^'^\ then L represents a linear index code for the instance I. 

Conversely, if there exist A £ B £ such that Ri = then 

R,Z = AV^^^Z + BLV^^'>Z = BLV''^'>Z ^ 0, 

for any Z £ Z^'^\ ■ 


B. The Optimal Length of an Vq-Linear T-IC 

We extend the definition of the min-rank of an instance of the ICSl problem, as given in im, to the ICCSl problem. We 
will show that this characterizes the shortest possible length of an F,-linear I-IC. 

Definition III.8. We define the min-rank of the instance I of the ICCSl problem over Fq to be 

k{X) = min{rank(A + R) : A £ F™^", Ai £ C C F;\ Vi £ [to]}. 

Observe that the quantity k(X) is dik(f?, A n A), which is the rank-distance of i? S jpmx" jq the F^-linear code A D A, ot 
equivalently the minimum rank-weight of the coset R+ {A O A) C F™^”. 

We now show that given the instance X, the minimum length of an F^-linear I-IC is given by its min-rank. 


Lemma III.9. The length of an optimal Vq-linear X-IC is k{X). 
Proof: Let L £ have rank N. From Lemma 


III.5 


L represents a linear code of length N if and only if for each 


i £ [to] there exist Ai £ A^’^ n A^^^ C Fg,Bi £ such that 


R, = - A, 


(i.e. if and only if Ri £ £ + A^'^^ for each i). Equivalently this holds if and only if there exist matrices A£AnA,B£ F™^^ 
such that R = BLV^^'> — A, in which case we have BLV^^^ = i? -f A in the coset R + {A D A). In particular, we have 
shown that every matrix L £ represents an F^-linear code for X only if 

BLV^^^ £R+iAnA) 

for some B £ so every such L has rank at least k(X). 

Now let A £ F™^" with Ai £ n A^^'i for each i £ [to]. Suppose that A-f i? has rank N. Since A,R £ A, there exists 
Z £ jy satisfying A + R = ZV^^\ Furthermore, there exist B £ F™^^ and L £ F^^*^® such that Z = BL. 

Then 

R = A- BLV^^'> 


so L represents a linear code of length N for the instance X. The length N is minimized for N = k{X), so there exists some 
L of rank k{X) representing a linear code for X. ■ 

Lemma |III.9| gives a naive algorithm for computation of a matrix L for an optimal linear I-IC; put each element of 
R + {A n A) into row-echelon form and choose one of minimal rank N = k{X). The non-zero rows of this matrix yields 
the required N x ds matrix L. We do not suggest this as a practical approach, since it requires 0{dgq^) operations, with 
£ = dimA n A. We mention this here to give a concrete illustration of the realization problem. As already observed in CD, 
the min-rank k{X) of the instance X generalizes the notion of the min-rank of the so-called side-information graph of the 
classical index coding problem, which is NP-hard to compute. A discussion on the various approaches to obtaining bounds on 
the optimal length of an index code can be read in 1331 . where the authors assert that graph-theoretic methods for constructing 
index coding schemes yield bounds on the optimal length of an index code, which are often out-performed by the min-rank. 
In fact all of these so called graph-theoretic methods, which use linear programming methods to obtain (possibly sub-optimal) 
solutions to the linear index coding problem can be extended to the ICCSl case. These results have been outlined in a separate 
forthcoming paper IHl. 


C. Upper Bounds on the Optimal Length of an Fq-Linear X-IC 

We now give upper bounds on k{X), applying probabilistic arguments. The main results are Corollary III.13 and Theorem 
[.18 both of which show that with certain constraints on N, there exists an Fg-linear Z-IC of length N. While both results 
essentially give lower bounds on the probability that a random N x ds matrix L represents an F^-linear I-IC, the key point 
is that these probabilities are positive, so that existence is guaranteed. It is from this observation that upper bounds on k{X) 
are achieved. 

We will use the following theorem proved by Zippel ll38l (see also ca, USD). We state it here for finite fields. 

Theorem III.IO. Let m,s be positive integers with q > m and let P(xi,Xg) be a non-zero multivariate polynomial in 
Fq[a;i, ...jXs] for which the largest exponent of any variable Xi is at most to. If (oi, ...Og) is chosen uniformly at random in 
F^ then the probability that P(ai, ..., af) equals zero is at most 1 — (1 — m/qY. 
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Remark III.ll. Before proving the following theorem, we note that if Xi,Xn are independent uniformly distributed 
random variables that take their values over a field F^, then the random variable 

I 


for some I G [n], ai G F^= Fg\{0}, has a uniform distribution. 

This is easily shown by an inductive argument. Clearly P{Zi =/?) = - for any (3 G ¥q since ai f 0. Moreover, for any 

^e[n],/3eF,, 


P{Zi=P) = P{Zi_i = P - atXi) 


E 

7GF, 


P{Xi = y)P{Z,_, =P- aa) = 


1 


Let m! be the number of distinct equivalence classes of [m] under the relation i = j if _ Let m be a set of m' 

representatives for the distinct equivalence classes of [m]. 

Theorem III.12. Let X be an instance of an ICCSI problem and let N = max{n — di : i G [m]}. Suppose that q > m!. If the 
entries of a matrix L G are chosen uniformly at random in F^, then the probability that L represents a linear code 

for I is at least (1 — 


Proof: From Corollary 

let e F”^* 


III.7 


if w (^LV^^^Z) > 1 for each Z G then L represents a code for X. For each i G rh, 
satisfy = 0 and have rank ki = n — di. Write Z'^'^\ The matrix L represents a code 

for X if is a full-rank matrix for each i G rh, which holds if and only if there exists a non-zer o ki x ki minor of 
Since the entries of L are uniformly distributed, so are the entries of L^'^\ from Remark III.ll Each such minor has the 
form MW = sgn{a) UjLi ■ Now IIzga may be viewed as a polynomial in Nds variables of degree 

ki < m N with each variable appearing with multiplicity at most m! in any term. Then the probability that L represents a 

i^rh 


code for X is the probability that Yiiem non-zero, which from Lemma 

Corollary III.13. If q > m! then k{X) < max{n — di : i G [m]\. 


III. 10 


is at least (1 — m'/q)^‘^^, for q > m'. 


Proof: Theorem III. 12 

N = max{n — di : i G 


guarantees the existence of some matrix L G that represents an Fg-linear Z-IC of length 


[m\\. The result is now immediate since N > k{X). ■ 

Remark III.14. In fact Schwartz’s result 4321? gives the lower bound ofl— Qfi probability of an N x ds matrix 

L representing an X-IC, where d is the average of the {di : i G [m]}. While this may give a higher lower bound, it places the 


restriction m! (n — d) < q and so in particular yields a weaker version of Corollary III.13 

Remark III.15. Note that if for some i, Z is non-zero for any Z G y^''\ then it satisfies the deco ding cri terion for any 


possible request vector Ri G X^^\ and hence delivers all possible requests to User i. Therefore, Theorem 111.12 and Corollary 


III.13\ should be viewed in the context of similar results in 4371? . which lead to partial clique-cover and partition multicast 


schemes. There is also a close association with the so-called Main Network Coding Theorem ifTOl Theorem 2.2] for multicast 
network coding. All of these results rely on the held size q being sufficiently large to invoke Zippel’s theorem and its variants. 


Remark III.16. The approach in to construct a linear IC for a partial clique is based on maximum distance separable 
(MDS) codes (this can be used also in the more general case of a multicast group, as described in Any generator matrix 

of an MDS linear code of length n and dimension k is such that any k columns are linear independent. Suppose that X is 
an ICSI instance and that di is the number of uncoded packets Xj known to the receiver i. Let G be a generator matrix of 
an MDS code of length n and dimension N = max{n — dt : i G [m]}. Then the sender can broadcast the following linear 
combination of the columns of G: 

+ ... + a:„g”. 


Without loss of generality, suppose that some receiver i has Tfjv+i,..., and can thus recover 

Lio- -I- ... -I- XmC 


\r 


From the MDS prop erty o f G, the first N columns of G form an invertible matrix so that the user can determine (Xi, ..., Xjy). 
In terms of Lemma 


IIL5 


the side-information is encoded by a matrix = [0|/] whose rows are standard basis vectors. 
Appending the N rows of G then results in a matrix with row space F^. In the above we get a matrix 

0 I 
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which has rank n, so any possible request vector is contained in (G) + 

However, this approach with MDS codes is not possible in the more general case of coded side information. 
Indeed, suppose we have n = m = i, = F^, the user side-information determined by 

= [ 1 0 0 0 0 i],v( 3 ) = [o 1 0 o],U^) = [o o i o], 

and requests 

i?i = [1110], i?2 = [0100], i?3 = [0010], i?4 = [0001], 

If the approach using an MDS code could be applied, then the matrix 


G = 


10 0 1 
0 10 1 
0 0 11 


defined over F 2 , could be used to encode a vector X = [Xi, ...,X 4 ,]'^. It is easy to check that Ri (G) + X^^^ and so from 
Lemma we have that Receiver 1 cannot decode. On other hand consider the matrix 

10 0 0 
L= 0101 
0 0 11 


defined over F 2 . Following the proof of Theorem III.12 we have the matrices 

= 


■ 1 

0 

0 ■ 


■ 1 

0 

0 ■ 


■ 1 

0 

0 ■ 


■ 1 

0 

0 ■ 

0 

1 

0 


0 

1 

0 

, = 

0 

0 

0 

, Z^^'I = 

0 

1 

0 

0 

0 

1 

0 

0 

1 

0 

1 

0 

0 

0 

0 

1 

0 

0 


0 

0 

0 


0 

0 

1 


0 

0 

1 


It is easy to check that for each i is a 3 x 3 matrix of full rank. Then the matrix L represents a linear index code for 

our instance but L does not generate an MDS code. In fact, direct inspection shows that C + = F| for each i so that L 

realizes an ¥ 2 -linear T-IC for any choice of R. This is equivalent to multicast network coding. 

We can remove the explicit constraint that q > m' to obtain an alternative lower bound on the probability of a realizable 
linear solution to the ICCSI problem. A straightforward counting argument yields the following ‘folklore’ result, a proof of 
which we include for completeness. Recall that for positive integers s>r the Gaussian coefficient 




w 


' s-l 

r 

Q 

N-r 


q iii=ov 

denotes the number of r-dimensional subspaces contained in an s-dimensional space over F^. If s < r this number is zero. 

Lemma III.17. Let W, V, S be subspaces of F” with W < V F S and of dimensions w, v and s respectively. Suppose that 
S' n 1^ has dimension L The number of N-dimensional subspaces U of S satisfying V FU GW is 

W 

r—0 

Proof: Let M be an r-dimensional subspace of F n S. A basis {toi, ..., rrir} of M can be completed to a linearly 
independent N-set by appending some nir+i, ...,mN C S\V in 

£-\-N—r — l N—r—1 

n - qn = H 

j—e j=o 

ways. There are {q^ — q^) ■ ■ ■ {q^ — q^~^) choices of rrir+i, ...,mN in M -f {rrir+i, ...,mAr)\M. Therefore there are 


J£—r){N—r) 


S — 

N — r 


A^-dimensional subspaces of S that meet V in zl given r-dimensional subspace of V. The result now 

9 I- 


follows since there are 


r-dimensional subspaces of W. 


-I 9 


We’ll now apply Lemma 


III. 17 


for the case W = X^^fV = {Ri,X^^^) to count the number of subspaces C G X^^^ 
contained in their intersection. This will tell us the number of subspaces C such that Ri ^ £ -f 
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Theorem III.18. Let I be an instance of an ICCSI problem. For each i S [to], let H = Wi. The probability 

that there exists an N-dimensional subspace C of such that for each i G [to], i?i S £ + X''^\ is at least 


1 - 


— 1 m Wi 


(tOi + l —r)(N—r) 


Q i=l r=0 

In particular, there exists a linear T-IC of length N if 


2=1 r=0 


Wi 


ds -Wi-l 

r 

q 

N — r 


Wi 


ds -Wi-l 

< 

q 

'ds 

r 

q 

N — r 

N 


Proof: Let i G [to]. An A^-dimensional subspace C of X^^'> satisfies Ri G L + X^'^'> if and only if [Ri + X^'^'>) C\ L is 
non-empty. The number of A^-dimensional subspaces of X^^\ that miss Ri + is the number of C that meet {Ri, X^^'>) i n 


a subspace of Since Ri G by assumption, {Ri, H X^^'> has dimension Wi -f 1. Then from Lemma 

the number of iV-dimensional subspaces C in X^^'^ that miss Ri + (i.e. the number satisfying Ri f: C + A"^®^), is 

Wi 

'y ' ^{wi-i-l-r){N-r) 


111.17 


r—0 


Wi 


ds -Wi-l 

r 

q 

N — r 


The probability that an arbitrary A^-dimensional subspace of misses Ri + A”*^®^ is this number divided by 


ds 

N 


. From 


-I g 


the union bound, the probability that an arbitrary A^-dimensional subspace of X^^'i misses every Ri + A’^®^ is upper bounded 
by 

'ds 
N 


— 1 m Wi 


EE q{wi + l-r){N-r) 


g i=l r=0 


Wi 


ds -Wi-l 

r 

q 

N-r 


Remark III.19. In the above argument, by invoking the union bound, we have assumed the most extreme case. For each i, 
let S'^®^ denote the set of N-dimensional subspaces C of X^^'> such that 

A’(®)n£ = (i?i,AfW)n£. 

Then the probability of a decoding failure is maximized when the size of the union of the is maximized. If the S”^®^ are 
pairwise disjoint we have 

iC [m] 


= E 


and so the given bound is sharp. This occurs if no N-dimensional subspace is contained in the intersection of any pair of the 
S'*-®! Moreover, there is no length N ¥q-linear T-IC if and only contains all N -dimensional subspaces of X^^\ 


< 


Theorem 


III.18 


often yields a better lower bound on the 


Remark III.20. Given an N-dimensional subspace £ _ 

probability that L G satisfying £ = {LV^^'^) represents a linear T-IC. Furthermore, it establishes existence for the 

case m > q. 

Example III.21. Let m = 6,n = A,q = 2 and let = F^. Suppose that A”*^®) has dimension di =2 for each i G {1,..., 6}. 


According to Theorem 111.18 the probability of the existence of a 3 x 4 matrix L over F 2 that represents a linear I-IC with 
these parameters is at least 0.2, but is inconclusive for the existence of a 2 x 4 encoding matrix L. Indeed there are such I-IC 
satisfying k{T) = 3 > 2 = max{n — di : i G [6]}, for example, T with user side-information determined by 

1/(1) = 



■ 0 

0 

1 

0 ■ 

^y(2) = 

■ 1 

0 

0 

0 ■ 

, = 

■ 1 

0 

0 

0 ■ 


0 

0 

0 

1 

0 

0 

0 

1 

0 

1 

0 

0 


■ 0 

1 

0 

0 ■ 

,1/(5) = 

■ 1 

0 

0 

0 ■ 

, = 

■ 0 

1 

0 

0 ■ 


0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

1 


and requests 


1/(4) 


i?i = [1000], i?2 = [0100], i?3 = [0010], i?4 = [0001], i?5 = [0100], i?6 = [1000] 


has min-rank equal to 3. 

On the other hand, there are several examples of T for the same parameters that have min-rank equal to 2, such as that 
defined by: 


1 /( 1 ) = 


10 10 
0 0 0 1 


^y(2) ^ 


10 0 0 
0 0 11 


, = 


10 0 0 
0 10 0 
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q 

N 

d 

m 

Theorem 

111.12 


Theorem |IIL18| 

4 

1 

9 

2 

0.001 

0.500 

4 

1 

9 

3 

9.536 X 10-4 

0.250 

4 

1 

9 

4 

— 


2.861 X 10-5 

4 

2 

8 

2 

9.536 X 10-4 

0.523 

4 

2 

8 

3 

9.094 X 10-43 

0.285 

4 

2 

8 

4 

— 


0.046 

4 

3 

7 

2 

9.313 X 10-4“ 

0.528 

4 

3 

7 

3 

8.673 X 10-4“ 

0.293 

4 

3 

7 

4 

— 


0.057 

4 

3 

6 

2 

9.094 X 10-43 

0.530 

4 

4 

6 

3 

8.271 X 10-35 

0.295 

4 

4 

6 

4 

— 


0.060 

4 

5 

5 

2 

8.881 X 10-45 

0.532 

4 

5 

5 

3 

7.888 X 10-34 

0.296 

4 

5 

5 

4 

— 


0.061 

4 

6 

4 

2 

8.673 X 10-4“ 

0.532 

4 

6 

4 

3 

7.523 X 10-34 

0.298 

4 

6 

4 

4 

— 


0.064 

4 

7 

3 

2 

8.470 X 10-33 

0.536 

4 

7 

3 

3 

7.174 X 10-43 

0.304 

4 

7 

3 

4 

— 


0.072 

4 

8 

2 

2 

8.271 X 10-35 

0.553 

4 

8 

2 

3 

6.842 X 10-4“ 

0.329 

4 

8 

2 

4 

— 


0.106 

4 

9 

1 

2 

8.077 X 10-33 

0.624 

4 

9 

1 

3 

6.525 X 10-55 

0.437 

4 

9 

1 

4 

— 


0.249 

4 

9 

1 

5 

- 


0.062 


TABLE II 


9 

N 

d 

m 

Theorem 

111.18 


4 

2 

9 

16 

1.4305 X 



4 

3 

8 

16 

0.0117 

4 

4 

7 

16 

0.0148 

4 

5 

6 

16 

0.0162 

4 

6 

5 

16 

0.0191 

4 

7 

4 

16 

0.0299 

4 

8 

3 

17 

0.0155 

4 

9 

2 

21 

0.0156 

8 

2 

9 

64 

5.8673 X 10-3 

8 

3 

8 

64 

0.0017 

8 

4 

7 

64 

0.0019 

8 

5 

6 

64 

0.0019 

8 

6 

5 

64 

0.0021 

8 

7 

4 

64 

0.0038 

8 

8 

3 

65 

0.0019 

8 

9 

2 

73 

0.0019 

16 

2 

9 

256 

2.3192 X 10-1“ 

16 

3 

8 

256 

0.0002 

16 

4 

7 

256 

0.0002 

16 

5 

6 

256 

0.0002 

16 

6 

5 

256 

0.0002 

16 

7 

4 

256 

0.0004 

16 

8 

3 

257 

0.0002 

16 

9 

2 

273 

0.0002 


TABLE III 


1 /( 4 ) = 


0 10 1 
0 0 10 




10 10 
0 0 0 1 




0 10 0 
0 0 0 1 


and requests 

i?l = [1100], i?2 = [0111], i?3 = [1010], i?4 = [1001], i?5 = [0100], i?6 = [1111]- 

We tabulate (see Table II) evaluations of the lower bound on the probability of the existence of linear I-ICs with 
and length TV = 10 — d where di = d for each Tth user in [m],m = m' > 1. In each case, we find that the maximal value of 


m' = m for which the bound of Theorem III. 18 can be applied is m = q. However, existence for m > q can be established 
by Theorem III. 18 for > n — d. In Table III we record parameters N and d = 10 — TV + 1 of I-IC known to exist using 
the bound of Theorem IIII.18I for a maximal number of users m. 
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II 


IV. Error Correction in the ICCSI Problem 

We now discuss error-correction in the ICCSI problem, extending the ideas presented in in, in two ways. The first direction 
is in the context of coded-side information, as presented in Gil, Co). The second allows for error correction for the rank 
metric, in the transmission of matrices in when f > 1. For the remainder, we let A4 C denote the message space 

associated with the ICCSI problem. 


A. The ECIC Problem 

Definition IV.l. Let X be an instance of an ICCSI problem and let be a positive integer. We say that the map 

E 

is a 5-error correcting code for X of length N, and say that E is an (Z, 5)-ECIC, if for each ith receiver there exists a decoding 
map 

A X ^F‘, 


satisfying 


D,{E{X) + W,A) = RiX 


for all AT e At and W £ F^^*, w(W) < 5 for some vector A € E is called a linear code for X, or an F^-linear Z-ECIC 
if E{X) = LV^^'fX for some L £ in which case we say that L represents the linear (Z, 5)-ECIC E. 


The standard coding theory argument gives a criterion for the existence of a (Z, 5)-ECIC, extending ifTTl Lemma 3.8]. The 
following result is the error correction analogue of Lemma III.5 It basically says that the zth receiver can correct up to 6 errors 
and uniquely decode its requested packet RiX if any pair of confusable data matrices have encodings LV^^'^X and LV^^'^X' 
that are at distance at least 25-1-1 apart. 


Theorem IV.2. Let X be an instance of an ICCSI problem and let N be a positive integer. A matrix L £ represents a 

linear {X,S)-ECIC if and only if for all i £ [m] it holds that 

>25-hi, 


for all X,X' £ M such that X - X' £ 

Proof: Let L £ F^^'^® represent a linear Z-IC. For each X £ F”^‘, define 

E(X, S) = {r : V = LV^^^X + IV, W £ ¥^^*, w(W) < 5}. 


It is not hard to see by an adaptation of the usual coding theory arguments that the ith receiver can correct 5 errors if and 
only if 

E(X,S)nB(X',S) =0 


for each X,X' £M such that V^^^X = V'^^X’ and RiX ^ RiX'. 

For the Hamming metric, the argument is almost identical to that for a classical error-correcting code. Suppose then that 
f > 1 and that w measures the rank weight. 

Let X,X' £ M such that X - X' £ Let LV^'iX = Z and let LV^'iX' = Z' . Let A = Z - Z' and suppose 

w{A) = d < 26. We may assume that A is in row-echelon form. Thus we can write A = W + W with w{W) = 5 and 
w{W') = d — 6 < 6, where the first 5 rows of W are the corresponding rows of A and the others are zero, and the rows 
indexed by [(i]\[5] in W' are the corresponding rows of A and the remaining N — d + 5 rows are zero. That is 


Ais] 

0 


w = 

^[5] 

0 

and W = 

0 

AidiMS] 




0 


Then Z — W = Z' + W £ B{X, 5) n B{X', 5), so if the requires spheres B{X, 5) are disjoint, L represents a linear Z-ECIC. 
Conversely, if B{X,5) n B{X',S) 0 then Z VW = Z' V W' for some W and W', each having rank at most 5. Thus 

Z — Z' = W' — W and in particular, by the triangular inequality, w{Z — Z') < w(W') + w(W) < 25. 


Let Z be an instance of the ICCSI problem and let i £ [m\. Let X,X' £ M such that X — X' £ zAi. For the case f > 1, 
for any Z S F^^'^® we have 

wiLVA\x - X')) = rank(ZE(®)(A - X')) < rank(L(®)(A: - X')) < w{X - X'). 

Then L does not represent a linear (Z, 5)-ECIC if rank(A — X') < 25. We therefore assume, for f > 1, that Al is a subset 
of F”^‘ of minimum rank distance at least 25 -I- 1, and furthermore that {V^lx : X £ A4} c F^®^‘ has minimum rank 
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distance 25 + 1. Delsarte’s result ifTJl Theorem 5.4] yields that \jv[\ < g('^s-2i5)t dehne ■= {X — X' : X, X' G Xi}. 
For the Hamming metric case, we assume Xi = F”^‘. 

We dehne the following sets for any non-negative integer 5: 

yf ■= {A e : rank(A) > 25 -f 1} 

:= {A e : rank(A) > 25 -f 1} 

Clearly a matrix L G represents a linear (Z, 5)-ECIC if and only if for all i G [m] it holds that 

>25-f 1, 

for all Z G Z^'‘^ if w is the Hamming metric and for all z G n X4‘^ if w represents the rank metric. 

Remark IV.S. Note that if Tank{LV^^^Z) >25-1-1 whenever Z G has rank at least 25 -I- 1, then rank(L14(‘^)Z) > r 
whenever Z G Z^'^'i has rank at least r for r G [25 + 1]. This can be seen by the following inductive argument. Suppose 
rank(LC^'®^Z) > r whenever rank(Z) > r for some positive integer r and suppose there exists some such Z G Z^'^\ Let 
X G Z^^^ have rank r — 1 and let S be set of r — 1 linearly independent columns of X. If va.'nk.{LV^^'^X) < r — 1 then S 
cannot be completed to a linearly independent set of size r in Z^'^'i by hypothesis, Then every column of Z is contained in the 
span of S, so in particular the column space of X contains an r-dimensional space, which is impossible. 


B. Bounds on the Optimal Length of an Error Correcting Index Code 

We denote by N(T, 5) the optimal length N of an F^-linear (Z, 5)-ECIC. Clearly 0) = k(Z). This section is devoted to 
obtaining bounds on this number. In ifTTIl a number of bounds are discussed, namely the a-bound, K-bound and Singleton bound. 
All of these bounds have extensions for Hamming metric (Z, 5)-EClCs. The a-bound holds for rank metric (Z, 5)-ECICs, but 
the question of the rank distance analogue of the /c-bound still open. We consider these two cases separately. 

1) Hamming metric {X,5)-ECICs: We assume throughout this section that w represents the Hamming weight and that 
A5(Z, 5) is the optimal length of an Fq-linear Hamming metric (Z, 5)-EClC. We denote by N{k, d) the optimal length £ of an 
¥q-[£,k,d] code, i.e. a fc-dimensional Fg-linear code in F^ of minimum Hamming distance d. 

Thanks to the following result we can restrict our study to the case t = 1. 

Lemma IV.4. Let t > 1. Consider two instances Z = (t,m,n, X , R) and Z' = X, X^^\ R). Then a matrix 

L G represents a linear {X,5)-ECIC if and only if L represents a linear {X',S)-ECIC. 

Proof: The matrices and the request vectors Rfs are the same for the two instances Z and X'. Let 

Zf'> ={Z G F”^‘ I V^^'^Z = 0 and RiZ 7 ® 0} and = {Z G F”''i | Z = 0 and R,Z 7 ® 0}. 

If L represents a linear (Z, 5)-ECIC, then for all Z G w{LV^^^ Z) >25-1-1, where w counts the number of non-zero 
rows. On the other hand, if L realizes a linear (Z', 5)-EClC, then for all Z G Z^^\ w(LV^^^Z) >25-1-1, where in this case 

the number of non-zero rows is the same of the non-zero entries of the x 1 vector LV^^'^Z. 

(i) 

Note that any Z G Z^ satishes 

Z = [Z\...,Z% 

with Z^ G Z^*^ U {0} for all j G [f], and at least one is different from zero. Without loss of generality, suppose Zi 7 ^ 0. Then 
if L represents a linear (Z', 5)-ECIC we have 

LV^^^Z = [LV^^'>Z\...,LV^^'>Z*] 

where the column LV^^'^Z^ has at least 25-1-1 non-zero entries, which implies that at least 25-1-1 rows of LV^^'>Z are 
non-zero. 

Conversely, let L G F^^®^® represent a linear (Z, 5)-ECIC, and let Z G Z^*^ such that 

Z=[ Z1,0,...,0 ], 


with Z^ G Z^*\ Then 


LV^^^Z = [ZE('^)z\0,...,0] 


has at least 25-1-1 non-zero rows, which means that 25-1-1 entries of the hrst column are non-zero. Therefore L represents a 
linear (Z',5)-ECIC. ■ 


For the remainder of this section, we fix f = 1, knowing that all the results hold also for f > 1 from Lemma IV.4 
We dehne the set; 

J{X) := {U < F®® : U\{0} C 
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We denote by a{X) the maximum dimension of any element of JiX), that is, the maximum dimension of any subspace of 
in u { 0 }. 

We hrst give an extended a-bound, which gives a lower bound on the length of an optimal Hamming metric (I, i5)-EClC. 
It may be helpful for the reader to think of this as the algebraic analogue of the independence number of a side-information 
graph. 

Proposition IV.5. (a-bound) Let T be an instance of the ICCSI problem. Then 

N{a{T),26+l)<N{X,5). 

Proof: Let L G j-gpj-esent a linear (Z, 5)-ECIC. Let U G J{X) have dimension k and let G be a rank k matrix in 

Fnxfe JJ ^ . X G F^}. Let 

Cu = {LV^^'>GX : X G F*} C F^. 

Then every element of [/\{0} is contained in for some i G [m], so w{LV^^'^GX) >25 + 1 for all non-zero X G F^, by 
assumption. This furthermore implies that LV^^^G has rank k over F^. It follows that Gjj is an Fq-[iV, fc, 25 + 1] code with 
N > N{k,2S 1). Choosing U of maximal dimension in J{X) for an (Z, 5) — ECIG of optimal length we see that 

Af(a(Z),25+ 1) < A5(Z,5). 


We give sufficient conditions for tightness of the a-bound. 

Corollary IV.6. LetX be an instance of the ICCSI problem. If there exists a matrix B G satisfying C 

Ri^ for all i G [to] then 

Af(a(Z),25+ 1) =A5(Z,5). 

Proof: Let B G satisfy the hypothesis of the corollary. Let G be a generator matrix for the Fq-linear [TV, a(Z), 25+ 

1] code {GX : X G F^®}, and let L = GB. Then 

w(LV^^'>Z) = w{GBV^^^Z) >25 + 1 

whenever BV^^'^Z ^ 0. If BV^^^Z = 0, then Z ^ for any i G [to] by our choice of B, so it follows that L represents 
an Fg-linear Hamming metric (Z, 5)-ECIC of length 

N = N{a{X), 25 + 1) > Af{X, 5). 


Setting 5 = 0 in the above gives the following lower bound on the min-rank of an instance as an immediate consequence. 
Corollary IV.7. Let X be an instance of the ICCSI problem. Then 

a(Z) < k(X), 

with equality occurring if there exists L G satisfying LV^^'^ H C Ri'^ for all i G [to]. 

The reader will observe that in fact the condition nC*'®^^ C Ri'^ for each i G [to] is simply the equivalent statement 

to that of Lemma |IIL5| found by dualization. 

Both the K-bound and the Singleton bound hold in the context of coded-side information. The proofs are trivial extensions 
of those given in im and are retained here only for the convenience of the reader. 

Proposition IV.8. (n-bound) Let X be an instance of the ICCSI problem. Then 


A5(Z,5) < iV(K(Z),25 + l). 


Proof: Let Li G F^®^®*® represent an optimal Fq-linear Z-IC of length Ni = n{X). Let Z 2 G have rank such 

that the code 

G = {L2X:XgF^®}<F^ 

is an [TV,TVi,25 + 1] linear code over Fg with TV = TV(TVi,25 + 1) for some 5. Since LV^^^Z G F^® is non-zero for all 
Z G S{X), 

w(L2LiV^^'>Z) >25 + 1, 

for all such Z. Then L = L2L1 represents a linear (Z, 5)-ECIC of length 

TV = N{k{X),26 + 1) > A5(Z, 5). 
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Proposition IV.9. (Singleton bound) Let T be an instance of the ICCSI problem. Then 

k{T) + 25 <M{X,5). 

Proof: Let L £ represent an optimal linear (I, (5)-ECIC over Fg, so that N = Af{X, S). Let L' the matrix obtained 

by deleting any 26 rows of L. By Theorem IV.2 for each i £ [m], 


so that 


>26+1, for all Z £ 


> 1, for all Z £ Z'^\ 


w 


So L' is a linear index code of length N — 26 for the instance I. Now L' has at least k{X) rows so that 

k{I) < Af(1, 5) - 26. 


Example IV.IO. Let m = 6,n = 5,q = 2 and let = F^. Suppose that X^'^'> has dimension di = 2 for each i £ {1, 
Let I be the instance dehned by user side-information 


y(i) 

y(4) 


■ 0 

1 

1 

1 

0 ■ 

, 1 /( 2 ) = 

■ 1 

0 

0 

0 

1 ■ 

, = 

■ 1 

1 

1 

1 

0 ■ 

0 

0 

1 

1 

1 

0 

0 

1 

1 

0 

0 

0 

0 

1 

1 

■ 1 

0 

0 

1 

0 ■ 


■ 0 

0 

1 

1 

0 ■ 

, = 

■ 1 

0 

0 

1 

0 ■ 

0 

1 

1 

1 

1 

0 

0 

0 

1 

1 

0 

0 

1 

1 

0 


and requests 


Ri = [10000], i?2 = [10000], i?3 = [00101], i?4 = [10001], i?5 = [11000], i?6 = [00111], 


6 }. 


It can be checked that nil) = 3. Moreover, contains the non-zero elements in the span of {(10110), (10011), (01100)}. 

Then a{I) = k{I) = 3. It follows from the a-bound that 6 = A^(3, 3) = N(a{I), 3) < Af(X, 1). From the K-bound we have 
6 = 1V(3,3) = 7V(k(I), 3) > Af{1, 1). 


Reed-Solomon codes and their g-analogues in the form of Gabidulin codes ll20l are examples of MDS and maximum rank 
distance (MRD) codes respectively. They were hrst introduced in ifTSll . In fact any extended generalized Reed-Solomon code 
over Fq is an MDS code of length g + 1 ll23l Theorem 5.3.4] so the existence of such codes is assured for such lengths. It 
is conjectured that any Fg-[7V, k, d] MDS code satishes N < q + 1 unless q is even and A: = 3 or A: = g — 1 (in which case 
N<q + 2)^. 


Corollary IV.ll. Let X be an instance of the ICCSI problem. If q> k(X) -f 2(5 — 1 then 


N{X, 6) = k{X) + 26. 


Proof: If g > k{X)+26—1 then there exists an Fg-linear [g-l-l, n{X), 26+1] MDS code, namely an extended Reed-Solomon 
code. Then we obtain 

k(I) + 26< M{X, (5) < N(fk{X),26 + 1) = k{X) + 26. 


As usual, we let Vq{n,r) be the size of a Hamming sphere of radius r in F^. We have the following generalization of ifTTl 
Theorem 6.1]. 

Theorem IV.ll. Let X be an instance of the ICCSI problem. Let L £ be selected uniformly at random over Fq. The 

probability that L corresponds to a Hamming metric ¥q-linear {X,6)-ECIC is at least 


l-Y^q-'^'-Hq-l) 

In particular there exists an ¥q-linear {X,6)-ECIC of length N if 

N > n — d — 1 + \ogq{m{q 


Vq{N,26) 

■ 

-l)Vq{N,26)), 


where d = minjiAi : i £ [to]}. 
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Proof: Let L be selected uniformly at random in If w{LV^^'>Z) < 2S for some Z in then L is not 5-delta 

error correcting at the ith decoder. The probability of this occurring at the ith receiver is upper bounded by 


|zW|y,(jV,25) 




so from the union bound the probability of this occurring at some ith decoder is at most 


E 1)K(V25) < 

ie[m] ^ 


Remark IV.13. If we let m" denote the number of equivalence classes of [m] under the relation m that i and j are equivalent 
if then in the above we obtain the following refinement: There exists an Vq-linear {X, S)-ECIC of length N if 

N>n — d—1 + \ogq{m"{q — l)Vq{N, 26)), 

where d = minjdi : i G [m]}. 

Let Hq denote the g-ary entropy function; 

iJq : (0,1) -)■ M : a: a; logq(g - 1) - a; logg(a;) - (1 - a;) log^(l - x). 

It is well known that the function Hq{x) is continuous and increasing on (0,1 — {1/q)). A proof of the following can be found 
in ESll . 

Lemma IV.14. Let A G (0,1 — (1/g)) be such that nX is an integer. Then 

Vq{n,Xn) < 


Corollary IV.15. Let X be an instance of the ICCSI problem with. Let X G Q such that 0<A<1 — 1/g and let N Gh satisfy 
XN G Z Then, choosing the entries of L G uniformly at random over the field Fg, the probability that L corresponds 

to a Hamming metric Vq-linear {X,6)-LCIC, with 6 = is at least 

1 - (9 - 1) E qN(i-H,{\)) ■ 

iGfh 

In particular there exists an Vq-linear Hamming metric iX, S)-ECIC if 

n . H 

m < ---, 

<7-1 

where d = minjdi : i G [m]}. 

2) Rank Metric {X,5)-ECICs: We assume throughout this section that f > 1, that w represents the rank weight and 
that N{X, 5) is the optimal length of an F^-linear rank metric {X, 5)-ECIC. Again, we hx some further notation. We let 
N{t, log^ M, d) denote the least integer s such that there exists a code in F®^* of minimum rank distance d and size M. We 
say that an F^-linear code of dimension k and minimum rank distance d in F^^‘ is a rank metric Fq-[n, k, d] code. In analogy 
with the previous section, we dehne the set; 

J{X) ;= {U C F^^‘ ; X - X' e ,for any X,X' € U}. 

and let a(X) := maxjlogq \U\ : U G J'(X)}. 

Theorem IV.16. Let X be an instance of the ICCSI problem. Then 


N{t,a{X),26-Gl) <M{X,5). 

Proof: Let L G V^^* represent an optimal (1,5)-ECIC. Let U G J{X) and dehne 

Cu = {LV^^^X ; X e [/} C F^^*. 

Then Cu has minimum rank distance 25 + 1 in F^^* since w{LV^^\X — X')) >25+1 for any pair X, X' G U. The result 
follows on choosing U G J{X) such that log^ \U\ = a{X). ■ 

The rank-distance Singleton bound ifTlIl states that for any code C in F^®^‘ of minimum distance 25 + 1 that 


log, |C| < 


t[N - 25) if f > X 
N{t - 25) if f < X 
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Codes that meet this bound are called maximum rank distance (MRD) codes. Combining the a-bound and the Singleton bound 
for rank-metric codes immediately yields the following. 

Corollary IV.17. (Singleton bound) Let T be an instance of the ICCSI problem. Then 

a{I) 




+ 25 ift>N{t,a{I),25 + l), 


ift < N{t,a{I),25 + 1). 


o^{I) 

t-25 

We now give a result on the existence of a linear encoding of length N for (I, (5)-ECIC, extending Theorem 6.1 in ifTTI . 
We let Vg{N,t,s) := \{X e : w{X) < s}| denote the size of a sphere of rank distance radius s in 

We will use the following result from IZTl . 

Theorem IV.18. Let W be an ¥q-vector space and let be a family of r-dimensional subspaces ofW. Let Homjr^(F*, W) 
denote the set of homomorphisms of F* whose images lie in jFr- Then 


|Hom^,(F;, 1E)| = IJ-,1 [](?*-9^')- 

3=0 

If VC is a subspace of F® then Homjr (F^*, W) corresponds to the set of all s x f matrices of rank r whose column spaces 
lie in W. 

Let m!" denote the number of equivalence classes of [m] under the relation m that i and j are equivalent if 

Theorem IV.19. Let T. be of an instance of an ICCSI problem and let L for some positive integer N. The probability 

that L represents a linear (I, S)-ECIC of length N is at least 


' r— 1 


-- 1 


iGmr>25+l yi—0 j—0 

In particular, there exists such a matrix L if 

i£rhr>2S-\-l \i—0 j—0 


2S r— 1 




^ -i q r—0 j—0 


r 

L J q 


^Nt 


< 


V{N,t,2S)- 


Proof: From Theorem IV.2 the matrix L G represents a linear (I, 5) if and only if for each i G [m], w{LV^^'>Z) > 

2(5-fl for any Z G Z^f^. Therefore, a decoding failure at the ith node occurs if and only if the sphere B 2 s{Z) = {LV^^^ Z+W : 
w{W) < 25} C F^^* contains the zero matrix for some Z G Zg''\ Then the probability of a decoding failure at the ith receiver 
is upper bounded by 




Zf |C(iV,C2(5) 


|F^xt| 

We define the following sets for each non-negative integer r: 


< 


r,Nt 


S-W = {M < : dim M = r} and = {M < C : dim M = r}. 


Then \si"^\ = 




and \Tj-\ = 


di 1 


(i) (i) 

. Now Sr (resp. T} ) is the set of column spaces in F" of all n x f 


matrices in (resp. in of rank r. Then from Theorem 


IV.18 


it follows that 


\zf\ = E (|Hom^u)(F‘,V«^)|-|Hom^(.,(F‘,V«"-ni?.^)|) 

r>2i5-|-l 


r>2i5-|-l 


3=0 


r>25-Fl \i=0 j=0 


Theorem IV.18 can also be applied to obtain 

2(5 2(5 r-1 

V(N,f25) = |Hom,(F‘,Ff )| = E 11(9'^ " 

r—0j—0 


r—0 


r 

L A q 


r 

L Jq 
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t > 

d > 

N > 

5 < 

m < 

6 

11 

16 

1 

239 

7 

11 

15 

1 

239 

8 

12 

13 

1 

239 

11 

12 

12 

1 

239 

20 

12 

11 

1 

239 

10 

11 

20 

2 

239 

11 

11 

19 

2 

239 

12 

11 

18 

2 

239 

14 

11 

17 

2 

239 

17 

11 

16 

2 

239 

9 

12 

19 

2 

239 

10 

12 

18 

2 

239 

11 

12 

17 

2 

239 

13 

12 

16 

2 

239 

16 

12 

15 

2 

239 

16 

12 

19 

3 

239 

19 

12 

18 

3 

239 

13 

13 

20 

3 

239 

14 

13 

19 

3 

239 

15 

13 

18 

3 

239 

16 

13 

18 

3 

239 

17 

13 

17 

3 

239 

18 

13 

18 

3 

239 

19 

13 

17 

3 

239 


TABLE IV 


Then the probability of a failure at the zth decoder is upper-bounded by 

'r—l r—1 


^-Nt 




26 r-1 


7=0 


<? r=0 j—0 


r 

L Jq 


r>2(5+l \7=0 

The result now follows from the union bound. ■ 

In the error-free case, that is for 5 = 0, Theorem |IV.19 asserts that there exists an N x ds matrix L of rank N representing 


an Fg-linear I-IC whenever 


Moreover 


i^rfi i^rfi 


1) > E 


i—di—N—l)t/ t _ 


(9* - 1) 


i^ni 


where k = max{n — di : i G [m]}. Then for N = k + £, there exists a linear I of length N as long as m'" < /{q* — 1). 

In particular, this shows that: 

Corollary IV.20. Let I be an instance of the of the ICCSI problem and let k = max{n — di : i G [m]}. If m"' < q^*/{q^ — 1) 
then k{I) < k 1 — 1. 


In Table IV we give parameters t, N, d for which the existence if a linear (Z, 5)-ECIC of length N is established by Theorem 
IV. 191 for n = 20, di = d for each i G [ml. 


V. Decoding Index Codes 

Error correction for index codes (as for non-multicast network codes) is non-trivial. We consider two approaches, one for 
rank-metric error correction and the other to correct Hamming metric errors, based on syndrome decoding. 

A. Syndrome Decoding for Hamming Metric Errors 

In EH the authors give a syndrome decoding algorithm for Hamming metric error correction in the ICSI problem. In this 
section we extend the algorithm to the case of ICCSI problem. 

Eor the remainder of this section, we let L G be a matrix corresponding to an (Z, 5)-ECIC. Suppose that for some 

i G [m] the *th user, receives the message 

r(,) = zy(^)x + iE(,) eFfx*, 
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where LV^^'^X is the codeword transmitted by S and is the error vector in F^. Since Ri ^ there exists an invertible 
matrix such that 

= [/| 0 ] where / is the identity matrix in and 

The matrix M(j) may be constructed by the zth user as follows. Choose a right inverse £ jrnxdi+i matrix 

G £ that has the rows of as its first di rows and has Ri in the final row. Then GA^i^ is the identity matrix in 

jp,(d. + l)x(d, + l) 

yW^d.+i = [0,...,0,1]. 

Choose to be an n X (n — di — 1) matrix whose columns form a basis of G^. Then M(^i) = is invertible and 

satisfies = [/| 0 ] and RiMf^i^ = e^^+i. 

Now define X' := M^^X £ F”^‘. Then we have 

vj^'^X = BjM-^X = X' for j £ [d,] 

and 

Ra = 


Lemma V.l. If Z a [/|0]"*~ and Zd^+i 7 ^ 0 


w{LV^^^Mi^^Z) >25 + 1. 

Proof: Let Z £ [/|0]-*' be such that ^di+i f 0. Then = [/|0].Z = 0 and RiM(^i^Z = e^.+iZ = Z^^+i ^ 0 so 


1V.2 


M^i-)Z £ Z^'^f The result now follows from Theorem 

Let V = and let |7[ := [n] \ [s]. Consider the following two codes. We define F^ to be the column space 

of the matrix £ F^^” and we define C(i)C F^ to be the subspace of spanned by the columns of 

For each i £ [to], we have C(i) C with dim(C(*^) = dim(C(i)) + 1. As usual, for an Fq-linear code C £ F 
C"*- := {y £ F^ : X • y = 0} to denote its dual code. Then we have C C(j)' 

some ri- Let id(j) be a parity check matrix of C(i) of the form 


N 


we write 


with Ti = dim(C(i)‘'‘) = dim(C(*^ ) + 1 for 



£ F 


ViXN 

q 


where 

Then 


is a parity check matrix of and \ 


Hu^L 


fdi + l 


= [sdi+1,0,... ,0]^ 


(3) 


for some Sdi+i € F, \ { 0 }. 

We now outline a procedure for decoding the demand RiX at the fth receiver, which is based on syndrome decoding. In the 
first step we compute syndrome, of H(i), in which is embedded a syndrome of H^'^\ In the second step a table of syndromes 
is computed for the code Finally, in the third step the output RiX is computed. 

Step I : Compute 






Step II: Find e £ F^^* with w{e) < S such that 


Q-i 


£ F 


ViXt 

q 


Step III: Compute 


(4) 

(5) 


Theorem V.l. If w{W(i)) < 5 then the procedure above has output Xd^+i = = RiX. 

Proof: We have 


(6) 
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= L'X' + VL(,) 


= + W(, 

and 

Y(,) - + IE(,). 

Then 





where ai = Sd,+iXd,+i + G and Pi = G F^’'* 

Let e G F^^‘ have Hamming weight at most S, as in Step II. Then W(i) — e G and w{W(^i) — e) < 2S, which means 
W(i) — e G C(i) from Lemma [VT| Therefore, 

{tti — /l(,)e)/sdi + l = (Sdi+l-’^^d, + l + ~ ^))/Sdi + l = ^di + 1- 


Example V.3. Let q = 2, m = n = 4:, t = l, S = 1 and i?j = for all i G [4] and X^^'> = F^. 
Assume 


Then 



■ 0 

1 

1 

0 ■ 

, 1/(2^ = 

■ 1 

0 

0 

0 ■ 


0 

0 

1 

0 

0 

0 

1 

1 


■ 1 

0 

0 

0 ■ 

, eW = 

■ 1 

1 

0 

1 ■ 


0 

0 

0 

1 

0 

1 

1 

0 


U,g[4]ZW = {[1001], [1000], [0111], [0100], [0010], [0110]}, 


so that a(I) = 2. In fact ([0100], [0010]) \ {0} C and from the a-bound we have 5 = N{2,3) < Af(1, 1). It can 

be checked that k{I) = 2, and so from the K-bound we have 5 = N{2, 3) > V(1,1). Then 


L = 


10 10 
0 111 
110 0 
1110 
0 0 10 


represents an optimal linear (Z, 1)-ECIC. 

Let X = [1111]^. The sender broadcasts LX. Suppose one Hamming error occurs and U 4 receives the vector Y 4 = 
LX + IL( 4 ), where 

IL(4) = [00010]. 

Then 

Y 4 = [ 01001 ]. 


Let 


M(4) = 


10 11 
0 0 0 1 
0 10 1 
0 0 10 


and L' = LM( 4 ) 


We obtain a parity check matrix of C( 4 ), as in 0 


1110 
0 110 
10 10 
1111 
0 10 1 


ff(4) 


0 0 0 1 1 
10 0 11 
0 10 11 
0 0 111 
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Applying Step I of our decoding algorithm we obtain 




■ 0 0 0 1 1 ■ 
10 0 11 

0 10 11 


■ 1 ■ 
1 

1 

1 

— 

■ 0 ■ 
1 

1 

0 0 111 


i 

1 


1 


Therefore a 4 = 0 and Pi = [111]^. Now from Step II, we obtain that the vector e = [00001]^ is a solution of and in 


Step in we obtain 


X3 = (0 - [00011] • [00001])/! = 1 = X4. 


Remark V.4. The above outlined decoding procedure extends that ofm Section VII] to the ICCSI case, firstly via the use 
of the matrices which transform it to an ICSI problem for the user i. However, Step III of our algorithm diverges from 

that one in a different sense: in it is required to solve the system Yi = LX — e given X whose coordinates agree with 
those of X in the side-information of the ith user, for any decoding. In our case a pre-computation is performed to determine 
as in Q and Sd^+i- We do this by solving the system 


h(i) 


p'd,+i 


L'ldi] 


[ 1 , 0 ,..., 0 ]. 


The computational complexities of both algorithms are similar, being dominated by Step II, where a low weight element of a 
coset of must be found. 


B. Decoding for Rank-Metric Errors 

In the model presented here, we assume that a matrix Y is transmitted and that at any given receiver, a matrix of the form 
Y -\-W K received. Therefore the decoding algorithm of the additive matrix channel as described in llT5l may be considered. 
We do not in fact necessarily assume that L represents an (1,5)-ECIC. Instead we add redundancy by embedding the broadcast 
LV^^'>X into a larger matrix with zeroes off the entries assigned to LV^^'>X. In this scheme, it is assumed that up to r packets 
are randomly injected into the network, in the form of a matrix of rank r. It can be assumed that with high probability, the 
first r rows of the error matrix are linearly independent. 

Given a N xds matrix L over Fg for an I-IC each ith receiver requires and LV^^'^X in order to retrieve its requested 

data RiX. Employing the method of lf35l . we let 

p _ I ^vx£ \ 

\ Onxv Q j ’ 

where Q = LV^^^X G and i = tif Lis known to each receiver and Q = [LV^^'>\LV^^'>X] G ^^d £ = ds -\-t 

if is not known to all receivers. 

Given an error matrix W of rank r < u, we write 

\ W21 W22 J ’ 

with lEii € IE 21 e W12 € F^^*, IE 22 e If VI"ii has rank r then 

r = rank(lEii) < rank ^ ^ < rank(lE) = r, 

so the rows of W 21 are contained in the row space of Wn. Therefore, TWn = W 21 for some T G F^^^. Then 
r = rank(lE) = rank(lEii) + rank(TlTi 2 — 1T22) = r + rank(TlEi 2 — W22), 


so we must have TW 12 = IT 22 . The matrix T can be easily computed, since the submatrices Wii,W 2 i are known to each 
receiver. Moreover, since W 12 is known, the decoder retrieves Q = —TW 12 + W 22 + Q- 

From Lemma III.5 the matrix L represents an F^-linear I-IC if and only if for each i G [m] there exist vectors U G F”, A G 
F^‘ and B G F^ such that 

R, = - BLV<'^^ and U = AV^^\ 

Once LV^^'^ and LV^^'^X is known at the ith receiver, its requested data RiX can be computed as follows. 

1) Choose U G Equivalently, choose A G F^* and write U = AV^^\ 

2) Solve R, + AV^^^ = BLV^^^ for some B G F^. 

3) Compute R,X = BY - AA^^\ 
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In practice, the decoder computes [S’jT], the reduced-row echelon form of the matrix 

LV'^S) Y 

and solves for Z in ZS = Ri to retrieve RiX = ZT. In particular, if Ri = for some j G [iV], then Ri already appears as 
a row of P, and the corresponding row of Q gives the required vector sought. 

Note that the method of IlSl assumes that the error matrix W has its first r rows linearly independent. This assumption is 

the decoder detects that error- 


referred to by the authors as error-trapping. In the event that rank(Wii) < rank 


trapping has failed to occur. If rank(W'ii) = rank 


W^ii 

W 21 


( Wn ] 

[ W21 J’ 

< rank(kk), the decoder does not detect that error-trapping has 


failed, so a decoding failure will occur. As noted in llTSll this probability is less than ■ 

Remark V.5. For the case t > 1, if L does represent a rank-metric {I,5)-ECIC, and is known to each receiver in 

advance of the transmission, then the sender may broadcast 


P = 

While the ith client receives the noisy transmission: 

P + W = 


OvXV 

Onxv 


^vxt 

LV(^^X 


Wi 


11 


Wi 


12 


W 21 W 22 +LV(^^X 


If error-trapping has failed and this is detected at the ith decoder then from Theorem IV.2 i/rank(lT 22 ) ^ then RiX is 
uniquely retrievable from the submatrix [W 22 + LV'^^'^X], However, the existence of an efficient algorithm to compute RiX 
given the received matrix [W 22 + LV^^'^X] for an arbitrary matrix L representing an (X,5)-ECIC is unlikely. 

Example V.6. Let TO = 4,n = 4, g = 2 and let = F^. Suppose that has dimension di = 1 for each i G {1,4}. 
Let X be the instance defined by user side-information 

1/(1) =[ 0 1 0 0],L(2) = [0 0 1 0 ],L( 3 ) = [ 0 0 0 l],l/(^(=[l 0 0 0 ], 

and requests 

Ri = [1000], i?2 = [0100], i?3 = [0010], i?4 = [0001], 

It can be checked that k(T) = 3 and that 


L = 


1 1 
0 1 
0 0 


0 0 
1 0 
1 1 


p = 


02 X 1 

LX 


is a matrix corresponding to an I-IC. Let v = 2, t = 1, X = [1010] and suppose that L is known to the receivers. Then the 
matrix P is given by 

02x2 

O3X2 

Now, suppose a user receives the matrix 


P-GW = 


Clearly, the error-trapping has succeeded, since the 2x2 upper left-hand submatrix of W has rank 2. Then we have 


■ 1 

0 

1 ■ 


■ 1 

0 

1 ■ 

1 

1 

0 


1 

1 

0 

0 

0 

1 

; which implies W = 

0 

0 

0 

0 

1 

0 


0 

1 

1 

1 

0 

0 


1 

0 

1 





■ 0 

0 ■ 

Wn = 

■ 1 O' 
1 1 

, 1^21 = 

0 

1 




1 

0 


from which we determine 


0 0 
1 1 
1 0 


T = 
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From the bottom right-hand part of P + W we compute 


■ 1 ■ 


■ 1 ■ 


■ 0 

0 ■ 


■ 1 ■ 
0 


■ 1 ■ 

0 

- TWi2 = 

0 

— 

1 

1 


= 

1 

0 


0 


1 

0 



1 


At this point the client can decode its demanded message. 


VI. Conclusions 

Permitting coded-side information in the index coding problem offers more potential for applications. While the connections 
to graphs and hypergraphs are no longer apparent as in the classical case, many of the associated combinatorial characterisations 
have algebraic interpretations. The ICCSI problem in some sense may be viewed as a g-analogue of the ICSI problem: the 
index set of size di of the side information of the ith user now being replaced by a vector space of dimension di. This viewpoint 
means that most bounds on the optimal length of an index code, both for noiseless and noisy channels have analogues in the 
more general setting of the ICCSI problem. Although it is likely that error-correcting decoding schemes for multicast network 
coding can be adapted for their index coding equivalents, the design of efficient error-correcting decoding algorithms for index 
codes remains a challenging problem. 
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